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GENTNER, DEDRE. Metaphor as Structure Mapping: The Relational Shift. CHILD DEVELOPMENT, 
1988, 59, 47-59. The goal of this research is to clarify the development of metaphor by using 
structure-mapping theory to make distinctions among kinds of metaphors. In particular, it is pro- 
posed that children can understand metaphors based on shared object attributes before those based 
on shared relational structure. This predicts (1) early ability to interpret metaphors based on shared 
attributes, (2) a developmental increase in ability to interpret metaphors based on shared relational 
structure, and (3) a shift from primarily attributional to primarily relational interpretations for meta- 
phors that can be understood in either way. 2 experiments were performed to test these claims. 
There were 3 kinds of metaphors, varying in whether the shared information forming the basis for 
the interpretation was attributional, relational, or both. In Experiment 1, children aged 5—6 and 9- 
10 and adults produced interpretations of the 3 types of metaphors. The attributionality and relation- 
ality of their interpretations were scored by independent judges. In Experiment 2, children aged 4— 
5 and 7-8 and adults chose which of 2 interpretations—relational or attributional—of a metaphor 
they preferred. In both experiments, relational responding increased significantly with age, but 
attributional responding did not. These results indicate a developmental shift toward a focus on 
relational structure in metaphor interpretation. 


The study of metaphoric development is 
fraught with contradictions. Evidence that 
young children are quite poor at metaphoric 
interpretation sits side by side with equally 
compelling evidence that they are uniquely 
talented at metaphoric language. In experi- 
mental studies of metaphor comprehension, 
young children typically perform quite 
poorly. A 4-year-old asked “Can a person be 
sweet?” answers literally (e.g., “Not unless he 
was made out of chocolate”) (Asch & Nerlove, 
1960). Young children do badly at matching 
sentences with metaphorically related pic- 
tures (Kogan, 1975) and at choosing appropri- 
ate metaphorical completions for sentences 
(Gardner, Kircher, Winner, & Perkins, 1975). 
Not until 14 years of age can children explain 
metaphors such as “the prison guard was a 
hard rock.” These and many other experimen- 
tal results seem to indicate that metaphorical 
ability develops very slowly. Indeed, until re- 
cently, the dominant position was that meta- 
phor develops only after the child has ac- 
quired basic competence at literal language 
(e.g., Inhelder & Piaget, 1958). 


This dour assessment contrasts sharply 
with the impression gained from children’s 
spontaneous speech, which is so full of cre- 
ative metaphors that some observers have 
viewed early childhood as a period of linguis- 
tic genius (Gardner, 1974). For example, a 15- 
month-old girl used “moon” to refer not only 
to the moon but to a half grapefruit and a 
hangnail (Bowerman, 1982). A 2-year-old boy 
I observed remarked that a crescent moon 
was “bent, like a banana,” and on another oc- 
casion jumped into a pile of pillows and an- 
nounced “leafs.” It is unlikely that all such 
extensions could be accounted for as errors 
(Thomson & Chapman, 1975). This seems to 
leave us with the paradox that young children 
can produce metaphors on their own but can- 
not comprehend them in experimental stud- 
ies. Part of the disparity may be due to the 
kinds of tasks used to assess comprehension. 
For example, children can demonstrate meta- 
phoric ability earlier with enactment tasks 
than with verbal explanation tasks (Gentner, 
1977a, 1977b; Pollio & Pickens, 1980; Vos- 
niadou & Ortony, 1986). But even with age- 
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appropriate methodology, young children 
perform far less well in comprehension tasks 
than their fluent production would lead us to 
expect (e.g., Gentner & Toupin, 1986). 


I suggest that an essential step in sorting 
out the developmental picture is clarifying 
what we mean by “metaphor.” The term cov- 
ers a number of different kinds of compari- 
sons, varying in their complexity and in the 
nature of the commonalities they convey 
(Lakoff & Johnson, 1980). In this article I use 
the structure-mapping theory of analogy to 
derive distinctions among kinds of metaphors 
(Gentner, 1980, 1983, 1986). To preview the 
conclusions, I will argue that children can 
produce and comprehend metaphors based 
on common object attributes before meta- 
phors based on common relational structure. 


Structure mapping.—The basic intuition 
of structure-mapping theory is that an analogy 
is a mapping of knowledge from one domain 
(the base) into another (the target) which con- 
veys that a system of relations that holds 
among the base objects also holds among the 
target objects. Thus an analogy is a way of 
noticing relational commonalities indepen- 
dently of the objects to which those relations 
apply. For example, Carnot in 1824 explained 
heat flow by analogy with a waterfall. The 
analogy conveys that just as a gradient from a 
high level to a low level will cause water to 
flow, given a path, so a gradient from a high 
temperature to a low temperature will cause 
heat to flow, given a heat path. This is a typi- 
cal analogy in that the higher-order relational 
structures are identical in base and target if 
the proper low-order correspondences among 
objects and functions are made (i.e., water > 
heat; level — temperature; and water path > 


heat path). 


In interpreting an analogy, people seek to 
put the objects of the base in one-to-one cor- 
respondence with the objects of the target so as 
to obtain maximum structural match.! The 
corresponding objects in the base and target 
do not have to resemble each other at all; ob- 
ject correspondences are determined by roles 
in the matching relational structures. Central 
to the mapping process is the principle of sys- 
tematicity: People prefer to map systems of 
predicates linked by higher-order relations 
with inferential import rather than to map iso- 
lated predicates. The systematicity principle 


is a structural expression of our tacit prefer- 
ence for coherence and deductive power in 
interpreting analogy. Besides analogy, other 
kinds of similarity matches can be distin- 
guished in this framework, according to 
whether the match is one of relational struc- 
ture, object descriptions, or both. As noted 
above, analogies discard object descriptions 
and map relational structure. Mere appear- 
ance matches are the opposite: They map as- 
pects of object descriptions and discard rela- 
tional structure. Literal similarity matches 
map both relational structure and object de- 
scriptions. 


Metaphor.—Now let us apply this frame- 
work to metaphor. Metaphors can be divided 
into four partially overlapping categories: at- 
tributional metaphors, relational metaphors, 
double metaphors, and a category of complex 
metaphors that resist analysis as one-one map- 
pings and that I will not consider here (Gent- 
ner, Falkenhainer, & Skorstad, 1987). Attribu- 
tional metaphors? (e.g., “Her arms were like 
twin swans’) are mere appearance matches: 
they convey common object attributes. Here, 
the attributes “long,” “thin,” and “graceful” 
can be mapped from the base domain of 
swans to the target domain of her arms. Rela- 
tional metaphors (e.g., Shakespeare’s “Look, 
he’s winding up the watch of his wit; by and 
by it will strike”) can be analyzed as analo- 
gies: they convey that the base and target 
share common relational structure. Here, the 
intended commonalities have nothing to do 
with the object attributes of a watch—a glass 
face, metal cogs, and so on; instead, the meta- 
phor conveys the common relational structure 
of a person setting a mechanism that will later 
produce seemingly spontaneous external ef- 
fects. Finally, besides pure relational and at- 
tributional metaphors, there are also double 
metaphors that are mixtures of the two, such 
as Verbrugge and McCarrell’s (1977) exam- 
ple, “Plant stems are drinking straws for 
thirsty trees.” This metaphor conveys both 
the common attributes “long, thin, tubular” 
and the common relational structure “raises 
fluids from a lower to a higher place in order 
to benefit some creature.” 


Adults have been found to prefer rela- 
tional metaphors over attributional metaphors 
and to focus primarily on relational common- 
alities in interpreting metaphors (Gentner, 


' The version of structure mapping presented here is a descriptive theory. A computer simula- 
tion based on a process model also exists (Falkenhainer, Forbus, & Gentner, 1986; Gentner, Fal- 


kenhainer, & Skorstad, 1987). 


? For present purposes, metaphor (“An X is a Y”) and simile (“An X is like a Y”) will be classed 


together. 
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1986; Gentner & Landers, 1985). In a prior 
study on which the present research is based, 
subjects produced interpretations of meta- 
phors and rated their aptness, having first 
written out descriptions of the objects con- 
tained in the metaphors (Gentner, 1986). The 
results indicated a relational focus in meta- 
phor interpretation in two ways. First, al- 
though the object descriptions contained both 
object attributes and relational information, 
only the relational information survived into 
the metaphor interpretations. For example, 
one subject described “cigarette” as follows: 
“chopped cured tobacco in a paper roll / with 
or without a filter at the end/held in the 
mouth / lit with a match and breathed through 
to draw smoke into the lungs / found widely 
among humans/known by some cultures 
to be damaging to the lungs / once consid- 
ered beneficial to health.” This description 
contains both relational and attributional in- 
formation. But the same person, when inter- 
preting the metaphor “Cigarettes are like 
time bombs,” included only relational infor- 
mation: “They do their damage after some pe- 
riod of time during which no damage may be 
evident.” The second indication of relational 
focus is that subjects considered metaphors 
more apt when they could find relational in- 
terpretations: Subjects’ aptness ratings for 
metaphors correlated positively with the 
judged relationality of their interpretations. In 
contrast, attributional commonalities did not 
contribute to aptness: The correlation be- 
tween aptness ratings and attributionality was 
nonsignificant and negatively trending. 
Adults thus demonstrate a relational focus 
both in interpreting and in judging metaphor. 


Do children show this same relational fo- 
cus? Although research on metaphorical de- 
velopment has not in general addressed this 
question, there is reason to suspect that they 
do not. Billow (1975) asked children to inter- 
pret two kinds of metaphors: “similarity meta- 
phors,” based on shared attributes (e.g., “Hair 
is spaghetti”) and “proportional metaphors,” 
based on a class of shared relations (e.g., “My 
head is an apple without any core’). Children 
9—14 years old were much better at interpret- 
ing the similarity metaphors than the propor- 
tional metaphors. Further, when given simi- 
larity metaphors even 5-year-olds could 
produce sensible interpretations almost 30% 
of the time. (They were not given the propor- 
tional metaphors.) These results suggest that 
attribute matches may be developmentally 
prior to relational matches. If this is true, then 
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many of the apparent paradoxes in the litera- 
ture on metaphor development may be trace- 
able to different developmental patterns for 
different kinds of metaphorical matches. The 
present research investigates this possibility. 


Experiment 1 


Experiment 1 was a straightforward ex- 
tension of the method used in the Gentner 
(1986) study. Children and adults interpreted 
metaphors of different types and rated the 
aptness of the metaphors. Their interpreta- 
tions were then scored by independent 
judges for relationality and attributionality 
(see Gentner & Stuart, 1983, for further de- 
tails).> There were three metaphor types: (1) 
attributional metaphors (e.g., “Pancakes are 
nickels” [both are round and flat]); (2) rela- 
tional metaphors (e.g., “A tire is a shoe” [both 
are used by moving figures as points of con- 
tact with the ground]); and (3) double meta- 
phors, with both attributes and relations in 
common (e.g., “Plant stems are drinking 
straws” [both are long and cylindrical; both 
are used to bring liquids from below to nour- 
ish a living thing]). 


If adults follow the predicted relational 
strategy, their responses should show three 
characteristics. First, relational and double 
metaphors should be interpreted relationally. 
(This prediction does not apply to the attri- 
bute metaphors because they are designed 
not to have legitimate relational interpreta- 
tions.) Second, adults should prefer meta- 
phors that allow relational interpretations, so 
their aptmess ratings should be higher for rela- 
tional and double metaphors than for attribute 
metaphors. Finally, adults’ aptness ratings for 
metaphors should be positively correlated 
with the relationality of their metaphor inter- 
pretations but not with the attributionality..To 
the extent that children fail to follow a rela- 
tional interpretation strategy, their results will 
differ from this pattern. 


The experimental procedure was aimed 
at minimizing the influence of other develop- 
mental changes besides metaphorical devel- 
opment. The first potential confound is differ- 
ences in pragmatic task knowledge: Young 
children may not know when they are sup- 
posed to give metaphorical answers. To cir- 
cumvent such differences, a series of amplifi- 
cations was used to make the task clear. A 
second possible confound is age differences 
in domain knowledge. To minimize such dif- 
ferences, only domains likely to be familiar to 


3 This experiment was conducted by P. V. Stuart as an undergraduate honors project at UCSD. 
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young children were used. Finally, to rule out 
differences in prior exposure to specific items, 
all comparisons were novel; no idioms or con- 
ventional metaphors were used. 


Method 


Subjects ——There were 10 subjects from 
each of three age groups: 5—6-year-olds (5 
boys and 5 girls), 9—10-year-olds (4 boys and 6 
girls), and college students from psychology 
classes at the University of California at San 
Diego (7 males and 3 females). Children were 
recruited from schools in Del Mar and La 
Jolla, California. 


Materials and design.—There were eight 
instances each of three types of metaphor, as 
shown in Table 1: (1) attributional metaphors 
(A metaphors), in which base and target 
shared many attributes but few relations; (2) 
relational metaphors (R metaphors), in which 
base and target shared relational structure but 
few attributes; and (3) double metaphors (D 
metaphors), in which base and target shared 
both relations and attributes. Thus, each sub- 
ject interpreted 48 objects and 24 metaphors. 
The design was age (3 levels) x metaphor 
type (R, A, or D). 


Procedure.—The task was administered 
to the adults in written form, in groups. Sub- 
jects were first asked to describe the separate 


objects that later appeared in the metaphors. 
The objects were presented in semirandom 
order, with pairs from the same metaphor sep- 
arated. All subjects, including children, were 
able to demonstrate basic familiarity with all 
of the terms used. After completing the object 
descriptions, subjects were given the meta- 
phors. They were asked to write their inter- 
pretations of what the metaphors were in- 
tended to convey and also to rate the aptness 
of each metaphor (i.e., how clever, interest- 
ing, or worthwhile it was) on a 1-5 scale. 
They also rated the metaphoricity of each 
comparison (i.e., the degree to which it ex- 
pressed literal similarity as opposed to meta- 
phorical relatedness). Metaphoricity ratings 
were not elicited from children and are not 
considered further here. Order of presenta- 
tion was semirandom, with the constraint that 
metaphors of the same type were separated. 


The task was administered to children or- 
ally and individually, and responses were 
tape-recorded. Children were first asked to 
describe the 48 objects. Then they were 
asked to interpret the metaphors. A graded — 
series of amplifications was used to be sure 
that children understood the task. The tech- 
nique was essentially one of restating meta- 
phors as similes, based on Reynolds and Or- 
tony’s (1980) finding that young children 
perform better with similes (“X is like a Y’) 


TABLE 1 


MATERIALS USED IN EXPERIMENT 1 


Metaphor Type 


Examples 


Relational metaphors....... 


The moon is like a lightbulb. 


A camera is like a tape recorder. 
A ladder is like a hill. 

A cloud is like a sponge. 

A roof is like a hat. 

Treebark is like skin. 

A tire is like a shoe. 

A window is like an eye. 


Attributive metaphors ...... 


Jelly beans are like balloons. 


A cloud is like a marshmallow. 

A football is like an egg. 

The sun is like an orange. 

A snake is like a hose. 

Soapsuds are like whipped cream. 
Pancakes are like nickels. 

A tiger is like a zebra. 


Double metaphors ......... 


A doctor is like a repairman. 


A kite is like a bird. 

The sky is like the ocean. 

A hummingbird is like a helicopter. 
Plant stems are like drinking straws. 
A lake is like a mirror. 

Grass is like hair. 

Stars are like diamonds. 


than with metaphors (“X is a Y’’). The experi- 
menter would ask, “Is a hummingbird a heli- 
copter?” If the child responded literally (e.g., 
“No, a hummingbird is a bird’’), the experi- 
menter asked, “What does it mean if I say ‘A 
hummingbird is a helicopter’?” If the child 
still responded literally, the experimenter 
asked, “What does it mean if I say ‘A hum- 
mingbird is like a helicopter?” If the child 
maintained a literal response after this third 
question, the literal response was recorded 
and the experimenter went on to the next 
metaphor. This technique was quite success- 
ful, eliciting nonliteral responses in all but a 
few cases. Moreover, after a few such se- 
quences, most children caught on that a non- 
literal response was desired in this task and 
thereafter produced nonliteral responses im- 
mediately for each new metaphor. After inter- 
preting the metaphor, children rated its apt- 
ness by pointing to one of five schematic 
faces, ranging from very sad (low aptness) to 
very happy (high aptness). They were told to 
point to the happy face if they thought the 
metaphor was interesting and to the sad face 
if they thought it was dull or boring. 


Scoring.—The purpose of the scoring 
was to provide an unbiased assessment of the 
commonalities selected by the subjects in 
their metaphor interpretations: in particular, 
whether these commonalities were relational 
or attributional. Groups of from two to four 
advanced undergraduates who had received 
training in the use of propositional notation 
rated each metaphor interpretation on two 
separate five-point scales, an attributionality 
scale and a relationality scale. Attributionality 
and relationality were scored on separate 
scales to allow for the possibility that the 
same response might contain both kinds of 
information. For the attributionality rating, 
judges rated whether the response described 
objects in and of themselves (e.g., “X is yel- 
low,” or “X is a sphere’’). Most adjectives re- 
ceived high attributionality ratings, as did 
many concrete nouns. For the relationality 
rating, the judges scored whether the re- 
sponse described relations between objects 
(or, in the case of higher-order relations, be- 
tween other relations in the domain). Transi- 
tive verbs (e.g., “X chases Y,” or “X causes 
Y’), comparative adjectives (e.g., “X is longer 
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than Y’), and prepositions (e.g., “X is inside 
Y’’) tended to receive high relationality rat- 
ings.* The specific content of the relations 
could vary widely, including spatial, causal, 
functional, and other kinds of relations. 


Two further points must be made about 
the scoring. First, to minimize the effects of 
differences in length of responses, an inter- 
pretation was given a high rating on relation- 
ality if it included any clearly relational state- 
ment, and similarly for attributionality. This 
method is sensitive to the presence or ab- 
sence of relational (or attributional) informa- 
tion but not to the number of different rela- 
tions (or attributes) mentioned. Second, 
because of the partly subjective nature of the 
scoring task, it was important to remove po- 
tential sources of bias. Therefore the judges 
were not told the subjects’ aptness and meta- 
phoricity ratings, and only one of the judges 
was aware of the experimental hypotheses. 
To conceal the ages of the subjects, the meta- 
phor interpretations were scored in random 
order. Interrater agreement ranged from 85% 
to 100% on different metaphors. 


Results 

There was a marked developmental in- 
crease in the use of relational information in 
the metaphor interpretations. Table 2 shows 
sample interpretations produced at different 
ages, as well as the ratings they received. The 
mean rating of the relationality of metaphor 
interpretations increases steadily with age for 
both the R and D metaphors (Fig. 1, top). 
(This increase does not occur for the A meta- 
phors because they do not permit relational 
interpretation.) In contrast, the attributional- 
ity of the metaphor interpretations remains 
constant across age for all three classes of met- 
aphor (Fig. 1, bottom). In short, there is a spe- 
cific developmental increase in the relation- 
ality, but not the attributionality, of the 
metaphor interpretations. 


Two separate two-way 3 (age) x 3 (meta- 
phor type) analyses of variance were per- 
formed, one for the relationality ratings and 
one for the attributionality ratings. In the rela- 
tionality analysis, there was a main effect of 
age, confirming the developmental increase 
in use of relational information in interpreting 
metaphors, F(2,27) = 12.76, p < .01. The in- 


4 Although most scoring decisions were straightforward, there were some ambiguous cases. 
These generally involved relational adjectives such as edible and soporific or relational nouns such 
as father. These terms express relational information, yet they appear on the surface as adjectives 
(Miller, 1979). For example, the adjectival phrase “X is soporific” is on the surface a one-place 
predicate (soporific [X]). But its meaning is the relational proposition “X puts people to sleep.” 
Because of their ambiguous status, such relational adjectives were scored as intermediate (3) on both 


the relational and the attributional scale. 
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TABLE 2 


SAMPLE INTERPRETATIONS OF DIFFERENT CLASSES OF METAPHORS IN EXPERIMENT 1] 


Attributional Metaphor 


Relational Metaphor 


Double Metaphor 
(Plant stems are 


Age (The sun is an orange.) (A tire is a shoe.) drinking straws.) 
5~6 ....... They’re round and Sometimes your shoe is They’re both straight. 
orange. (1,5) black and the tire is (1,5) 
black. (1,5) 
9-10 ...... It’s like a circle and You can go places on Plant stems are thin and 
so is the sun. (1,5) both. (5,1) so are straws. (1,5) 
Adult...... Both are orange; both Both are coverings that They are both used for 


spherical. (1,5) 


come in direct contact 
with the terrain. (5,1) 


drawing in water. 
They are both tubu- 
lar. (5,5) 


NOTE.—The figures in parentheses give the rated relationality and attributionality of the response. 


teraction of age X metaphor type was also 
significant, reflecting the fact that the age in- 
crease in relationality occurs only for the R 
and D metaphors, F(4,54) = 5.48, p < .01. In 
the attributionality analysis, neither the main 
effect of age nor the age xX metaphor type 
interaction was significant, confirming that 
there was no developmental increase in at- 
tributionality. In both the.relationality and at- 
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Fic. 1.—Top, Results of Experiment 1: mean 
ratings of relationality for interpretations of differ- 
ent types of metaphor across age. Bottom, Mean 
ratings of attributionality for interpretations of dif- 
ferent types of metaphors across age. 


tributionality analyses, the main effect of met- 
aphor type was strongly significant, F(2,54) = 
191.63, p < .001, F(2,54) = 265.06, p < .001, 
respectively. For all ages, the a priori categor- 
ization of stimuli held up well: on relational- 
ity, R metaphors were highest, followed by D 
and then A metaphors; and on attributional- 
ity, A metaphors were highest, followed by D 
and then R metaphors. 


The performance on D metaphors is of 
special interest. By design, the D metaphors 
could support either an attributional or a rela- 
tional interpretation. The results show a rela- 
tional bias in the two older groups but not in 
the youngest children. Planned comparisons 
between the relationality ratings and the at- 
tributionality ratings of the D metaphors 
showed that, for the 9-year-olds and adults, 
the mean relationality is significantly greater 
than the mean attributionality, (9) = 2.78, p 
< .05, and ¢(9) = 3.79, p < .05, respectively. 
For the 5—6-year-olds, there is no significant 
difference, t(9) = 1.93, p > .05. 


Aptness ratings——As predicted, adults 
preferred metaphors for which they could 
find relational interpretations. Adults’ mean 
aptmess ratings for R (M = 2.86) and D (M = 
2.95) metaphors were considerably higher 
than for A metaphors (M = 2.30), t(79) = 
4.30, p < .001, for D versus A, and ¢(79) = 
3.87, p < .001, for R versus A metaphors. 
Children did not show this relational prefer- 
ence. For 5—6-year-olds, M = 2.08, 2.14, and 
2.26, and for 9-10-year-olds, M = 1.99, 2.18, 
and 2.19 for R, D, and A metaphors, respec- 
tively. Children’s aptness ratings did not dif- 
fer significantly across the three types of met- 
aphors. Another indication of relational focus 
in adults is that their aptness ratings were 
positively correlated with the relationality of 
their interpretations and negatively correlated 
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with the attributionality, r(22) = .55, p < .01, 
and, r(22) = -—.42, p < .05, respectively. 
Children did not show this pattern. Their apt- 
ness ratings showed no correlation with either 
relationality, r(22) = .15 for 5-year-olds and 
—.18 for 9-year-olds, or attributionality, r(22) 
= .01 for 5-year-olds and .29 for 9-year-olds. 
We must be cautious in evaluating the apt- 
ness data because the children may simply 
have lacked facility with the aptness scale. 
Still, the results suggest a developmental dif- 
ference in the criteria for judging aptness in 
metaphor. 


Discussion 

As predicted by structure-mapping the- 
ory, adults focused on mapping relational sys- 
tems. There are several indications of this pat- 
tern. First, adult interpretations for R and D 
metaphors were highly relational; this is es- 
pecially telling for D metaphors, which could 
have supported either kind of interpretation. 
Second, adults rated the R and D metaphors 
as more apt than the A metaphors. Third, 
adults’ aptness ratings correlated positively 
with the relationality of their interpretations 
but negatively with attributionality. This pat- 
tern replicates and strengthens the positive 
correlation between aptmess and relationality 
found in prior research with adults (Gentner, 
1986). Adults appear both to seek relational 
predicates in metaphorical mapping and to 
judge the aptness of a comparison according 
to the relationality of the mapping. 


Young children did not share this rela- 
tional focus. The 5-—6-year-olds produced 
fewer relational interpretations of R and D 
metaphors than the older groups, and they 
showed no tendency to produce relational 
rather than attributional interpretations for 
the D metaphors. There was a marked in- 
crease in relationality with age. This develop- 
mental trend was specific to relational infor- 
mation; there was no age increase in 
attributionality. Turning to aptness, one might 
have suspected that children, though not flu- 
ent enough to produce relational interpreta- 
tions, would nevertheless show an apprecia- 
tion of relationality in their aptness ratings. 
But this was not the case. Children’s aptness 
ratings reflected no preference for R and D 
metaphors over A metaphors. Further, there 
was no correlation between children’s apt- 
ness ratings for metaphors and the relational- 
ity of their own interpretations. 


Overall, these results appear to show a 
clear developmental increase in relational fo- 
cus. But there were some possible problems 
with the methodology used in Experiment 1. 
One limitation was the use of verbal explana- 
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tions as responses. Children’s ability to give 
an explicit verbal account of their knowledge 
typically lags behind their ability to demon- 
strate that knowledge in other ways (e.g., Pol- 
lio & Pickens, 1980; Vosniadou & Ortony, 
1983, 1986). A second concern was the apt- 
ness-rating task. As discussed above, it was 
not clear that children understood the sad-to- 
smiling face scale. To address these prob- 
lems, a second experiment was performed us- 
ing a choice task rather than a production task. 
Children were presented with two interpreta- 
tions—one relational and one attributional— 
for each metaphor and were asked to choose 
the interpretation they preferred. In this way, 
young children could express a preference for 
relational interpretations without having to 
generate them. After this choice task, a rating 
task was given, using a new method to elicit 
the ratings. Children were asked to indicate 
their ratings on a “goodness meter,” a vertical 
scale something like a thermometer. Finally, 
one other difference between the two studies 
was that, because Experiment 2 used a com- 
prehension paradigm, slightly younger chil- 
dren could be tested than in Experiment 1. 


Experiment 2 


Method 

Subjects.—There were 36 adult subjects, 
all University of Illinois undergraduates, and 
36 children: 18 4—5-year-olds, with a mean 
age of 4-10, and 18 7—8-year-olds, with a 
mean age of 7-11. They were approximately 
evenly divided between males and females. 


Materials.—Eighteen metaphors were 
used: eight R metaphors, eight D metaphors, 
and two A metaphors. The metaphors were 
taken from Experiment 1 (see Table 1), except 
that two of the D metaphors from Experiment 
1 were judged unsatisfactory and replaced: 
the metaphors “A lake is like a mirror” and 
“Stars are like diamonds” were replaced by 
“Blood is like water” and “A submarine is 
like a whale.” Because it proved to be 
difficult to design plausible relational inter- 
pretations for the A metaphors, only two A 
metaphors were used, both taken from Exper- 
iment 1: “Jelly beans are like balloons” and 
“A snake is like a hose.”” Due to experimenter 
error, the two new D metaphors were omitted 
in the adult version of the choice task, leaving 
only 16 metaphors: eight R, six D, and two A 
metaphors. 


For each metaphor, two relational and 
two attributional interpretations were de- 
vised, based on the interpretations generated 
by subjects in Experiment 1. Three criteria 
were applied in selecting among the re- 
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TABLE 3 


SAMPLE METAPHORS AND INTERPRETATIONS USED IN EXPERIMENT 2 


Samples 


Interpretations 


Relational metaphors: 
A cloud is like a 


sponge R1) Both can hold water 


R2) Both can give off water 


A tire is like a 

ShOC* 5.2505 030.88 
Double metaphors: 

Plant stems are like 

drinking straws .. 

R2) Both draw liquid up 


Grass is like hair... R1) Both grow quickly 


R2) Both cover and protect something 


Attributional metaphors: 
Jelly beans are like 
balloons 


R1) Both are fun at parties 


R2) Both can be tossed in the air 


Rl) Both squirt liquid 
R2) Both can curl up 


hose 


Rl) Both are used to move something 
R2) Both cover the bottom of something 


Rl) Both can be used to get water 


Al) Both are soft 
A2) Both are fluffy 


Al) Both are made of rubber 
A2) Both can be black 


Al) Both are long 

A2) Both are thin 

Al) Both are long 

A2) Both are skinny 

Al) Both can be different colors 
A2) Both are round 


Al) Both are long 
A2) Both are wiggley 


NoTE.—R1 and R2 denote relational interpretations, and Al and A2 denote attributional interpretations. 


sponses generated in Experiment 1. First, to 
be sure that the responses were either clearly 
relational or clearly attributional, the selec- 
tion was limited to interpretations that had 
received scores of 4 or better from the judges 
in Experiment 1. Second, among the interpre- 
tations that met the first criterion, the two 
most frequent relational interpretations and 
the two most frequent attributignal interpreta- 
tions were selected. Third, because it was 
crucial that young children understand the 
choices, in tallying the responses those of 5— 
6-year-olds were given the most weight. For a 
few metaphors, there were not enough re- 
sponses from Experiment 1 that met the crite- 
ria. In this case, I either used a less frequent 
response or invented an extra response to 
bring the total up to two attributional and two 
relational interpretations. Responses were re- 
worded to simplify any difficult language. Ta- 
ble 3 shows samples of the metaphors and 
interpretations used. 


Design and Procedure 

In both the choice task and the rating 
task, the key factors were age (3 levels), meta- 
phor type (R or D, with A considered sepa- 
rately), and interpretation type (relational or 
attributional). For adults, the tasks were ad- 
ministered in written form to two separate 
groups of 18 subjects. For the choice task, 
each metaphor was given with its two inter- 
pretations and subjects indicated which inter- 
pretation they preferred. For the rating task, 


each metaphor was given on a page with its 
two interpretations, and subjects were asked 
to rate how well they liked each interpreta- 
tion on a 1-5 scale, with 5 being “very inter- 
esting” and 1 being “boring.” In both tasks, 
the list of metaphors was presented twice, 
once with each set of interpretation pairs. 
Two different orders were used, one the re- 
verse of the other. 


Children were tested individually in 
their homes. They received the choice task 
followed by the rating task. A quiz-show for- 
mat was used to simplify the task. Two stuffed 
dolls were used as quiz show contestants. 
Each doll had a canister to collect its win- 
nings. The child was told that Pink Panther 
and Tweetie Bird were competing on a quiz 
show, and that it was the child’s job to indi- 
cate which doll had given the best answer by 
dropping a token into the winning doll’s can- 
ister. To be sure children understood the for- 
mat, each child completed two practice tasks 
before starting the actual experiment. 


Practice tasks—The first practice task 
was picture labeling. The experimenter held 
up a picture of a familiar animal and ex- 
plained that the goal of the contestants was to 
give its common name (e.g., “elephant’”). One 
doll produced the correct answer and the 
other a clearly incorrect answer. The child in- 
dicated which doll was correct by dropping a 
token in that doll’s canister. In this and all 


succeeding tasks, each doll was correct half 
the time, in a semirandom predetermined or- 
der. The second practice task was sentence 
interpretation. A simple sentence was read 
and each doll gave an interpretation of the 
meaning of the sentence. One interpretation 
was always correct and the other clearly 
wrong. For example, the sentence might be 
“Santa Claus is on the roof” and the two inter- 
pretations: “There’s a jolly fat man on the 
roof” (correct), and “I had spaghetti for din- 
ner’ (wrong). The child indicated the correct 
answer in the manner described above. The 
criterion for passing the practice tasks was six 
correct in no more than 10 trials. In fact, most 
children achieved criterion in fewer than 
eight trials. Only one child—a 4-year-old— 
had to be dropped from the experiment due 
to failing the practice task. After the practice 
tasks, the experiment began with the choice 
task for metaphor interpretations. 


Choice task.—The experimenter read 
the “quiz question” metaphor: for example, 
“How is a cloud like a sponge?” Then the 
experimenter would point to each doll in tum 
and give the doll’s response: “He says, ‘Be- 
cause they’re both soft and fluffy’; and he 
says, ‘They both hold water.’ Whose answer is 
better?” The child would give a token to the 
winning doll. In al] cases, one doll gave an 
attributional interpretation and the other a re- 
lational interpretation, with the assignment 
varying such that each doll had equal num- 
bers of relational and attributional answers. 
Order of the relational and attributional inter- 
pretation was varied so that each occurred 
first half the time. After all 18 metaphors had 
been presented with the first set of interpreta- 
tions, they were presented again with the sec- 
ond set of interpretations. The order in which 
the relational and attributional interpretations 
was given on the second pass was reversed 
from the order on the first pass. 


Rating task.—The experimenter began 
by saying, “Now we're going to do something 
else. You picked which answer you liked bet- 
ter. Now you're going to tell me just how good 
you think each answer is.” Then the child 
was shown the “goodness meter,” a vertical 
scale standing about 2 feet high and 3 inches 
wide, with gradations running from 1 (lowest) 
to 5 (highest). The experimenter pointed to 
the numbers and described their use: “A 5 
means that the answer is really good. But a 1 
means that it’s really bad. You didn’t like the 
answer at all. Now 3 is in the middle. It 
means that the answer was OK.” When the 
child could demonstrate knowledge of the 
scale, the rating task began. The experimenter 
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read the “quiz question” metaphor (e.g., 
“How is acloud like a sponge?’”’) and gave the 
first doll’s response: “He says, “Because 
they’re both soft and fluffy.” How good is that 
answer?” After the child rated the answer on 
the goodness meter, the experimenter re- 
peated the metaphor and gave the second 
doll’s response, which the child again rated. 


Definition task.—At the end of the ses- 
sion, as a check on their knowledge, children 
were asked to define the words used in the 
metaphors. For younger children, all words 
were tested. For older children, only more 
difficult words (e.g., “submarine”) were 
tested. If children had difficulty with the ini- 
tial question (e.g., “What is a sponge?”), the 
question was changed to “What do you do 
with a sponge?” All children demonstrated 
adequate understanding of the words. 


Results and Discussion 

Choice task.—Relational responding in- 
creased with age, as shown in Figure 2. The 
proportions of relational responses for the R 
and D metaphors were .61, .69, and .89 for the 
4—5-year-olds, 7—8-year olds, and adults, re- 
spectively. A 3 (age) x 2 (metaphor type) 
mixed-measures analysis of variance on the 
proportion of relational responses for the R 
and D metaphors showed a main effect of age, 
confirming the developmental increase in re- 
lational preference, F(2,51) = 10.51, p < 
.0001. (However, all age groups showed 
above-chance relational responding [Kolgo- 
roff-Smirnov goodness-of-fit with chance at 
.50, all z’s > 1.50, p < .05].) There was also a 
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Fic. 2.—Results of Experiment 2: proportion 
of relational interpretations chosen for different 
types of metaphors across age. 
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main effect of metaphor type, reflecting a 
stronger relational preference on R metaphors 
(M = .77) than on D metaphors (M = .67), 
F(1,51) = 12.57, p < .001. The interaction be- 
tween age and metaphor type was not signifi- 
cant. 


Rating task.—There was an age increase 
in relational bias on the R and D metaphors. 
As shown in Figure 3, the mean rating of the 
relational interpretation increases with age, 
and the mean rating of the attributional inter- 
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Fic. 3.—Results of Experiment 2: mean rat- 
ings of relational and attributional interpretations 
for different types of metaphors across age. 


pretation decreases with age. These patterns 
were confirmed in a 3 (age) x 2 (metaphor 
type) X 2 (interpretation type) mixed-mea- 
sures analysis of variance over the R and D 
metaphors. As predicted, there was a main 
effect of interpretation type, F(1,51) = 95.88, 
p < .0001, confirming that ratings were higher 
for relational interpretations than attributional 
interpretations. There was also a main effect 
of age, reflecting higher overall ratings for the 
7—8-year-olds than for the other two groups, 
F(2,51) = 6.27, p < .01. The expected interac- 
tion of age xX interpretation type was signifi- 
cant, F(2,51) = 30.83, p < .0001, reflecting 
both the rise in ratings for the relational inter- 
pretations and the drop in ratings for the at- 
tributional interpretations with age. There 
was also a metaphor x interpretation interac- 
tion, F(1,51) = 13.94, p < .001, indicating a 
somewhat greater relational preference on the 
R metaphors than on the D metaphors. There 
was no main effect of metaphor type and no 
other significant interactions. 


As shown in Figure 3, relational interpre- 
tations become steadily more valued with age 
and attributional interpretations become less 
valued. Planned comparisons confirmed that 
the older groups rated relational interpreta- 
tions significantly higher than attributional in- 
terpretations. For R metaphors, the difference 
was significant for both 7—8-year-olds and 
adults, but not for the 4—5-year-olds, (17) = 
5.50, p < .0001; t(17) = 10.17, p < .0001; and 
t(17) = 1.33, p > .05, respectively. For D 
metaphors, only the adults showed a signifi- 
cant difference, t(17) = 9.59, p < .0001; nei- 
ther the 4—5-year-olds nor the 7—8-year-olds 
showed a significant relational preference on 
D metaphors, t(17) < 1, and t(17) = 1.89, p< 
.08, respectively. 


Attributional metaphors.—On the A 
metaphors, subjects showed a clear prefer- 
ence for the attributional responses, as ex- 
pected. In the choice task, all groups chose 
the attributional response more often than the 
relational response on the two A metaphors 
(see Fig. 2). Similarly, in the rating task all 
age groups rated the attributional response 
higher than the relational response on the two 
A metaphors (see Fig. 3). This difference was 
significant for 7~8-year-olds and adults, ¢(17) 
= -2.71, p < .05, and ¢(17) = -4.91, p< 
.0001, respectively, although not for 4—5-year- 
olds, t(17) = —1.81, p < .09. Performance on 
the A metaphors helps rule out some artifac- 
tual explanations of the results. The apparent 
relational preference on R and D metaphors 
might have been based on some response 
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bias, such as a preference for longer? or more 
abstract-sounding interpretations. However, 
the subjects’ preference for attributional inter- 
pretations on the A metaphors tends to indi- 
cate that their preference for relational inter- 
pretations on the R and D metaphors was a 
genuine interpretational choice. 


General Discussion 


The results of the two experiments show 
a developmental increase in relational focus 
in the interpretation of metaphors. As in prior 
research (Gentner, 1986), adults showed a 
strong relational bias. In Experiment 1, they 
generated relational interpretations whenever 
possible, and they rated metaphors as more 
apt the more relational commonalities they 
found and less apt the more attributional com- 
monalities they found. In Experiment 2, on 
both the choice task and the rating task, adults 
showed a clear preference for relational inter- 
pretations for relational and double meta- 
phors. 


This relational focus developed gradu- 
ally. In Experiment 1, the relationality of the 
interpretations produced for relational and 
double metaphors increased steadily from 6 
years to adulthood. Experiment 2 showed the 
same developmental pattern for comprehen- 
sion. Both on the choice task and on the rat- 
ings task, there was a developmental shift to- 
ward preferring relational interpretations over 
attributional interpretations of relational and 
double metaphors. In contrast, on the attribu- 
tional metaphors, all three age groups pre- 
ferred attributional interpretations both in 
production and in comprehension; there was 
no developmental change whatsoever. This 
indicates (1) that the developmental shift is 
specifically relational and not simply the re- 
sult of some general increase in verbiage with 
age, and (2) that not all metaphors are alike 
developmentally. Indeed, young children 
seem to deal with attribute matches in much 
the same way as adults. What develops is rela- 
tional similarity. 

The notion of a relational shift helps clar- 
ify the conflicting findings in the develop- 
ment of metaphor. For example, the paradox 
of production before comprehension may ad- 
mit of the following resolution: children both 
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produce and comprehend attributional meta- 
phors very early, but relational matches come 
later. Because comprehension tasks typically 
include both kinds of metaphors, children do 
poorly in these tasks well after they can spon- 
taneously produce attributional comparisons. 
A supporting observation is that most of 
the early spontaneous metaphors appear to 
be based on perceptual attribute overlap, as 
in Bowerman’s “moon-hangnail” example 
quoted above. In Winner’s (1979) study of the 
spontaneous metaphors of one child (Adam) 
from age 2-3 to 4-10, the most frequent basis 
for metaphorical comparison was the shared 
object attribute of contour: for example, re- 
ferring to a pencil as a “big needle.” Rela- 
tional metaphors occasionally occurred, espe- 
cially during pretending games (e.g., at 3-3 
Adam placed a small alphabet letter on top of 
a big one and said, “Adam sleeping on 
Daddy”), but they were rare. 


There is some prior research supporting a 
relational shift. In the study by Billow (1975) 
discussed above, metaphors based on shared 
attributes (e.g., “He had a pickle for a nose”) 
were comprehended well before “propor- 
tional metaphors,”” based on shared relations 
(e.g., “The sun wakes the seeds”). The rela- 
tional shift is also compatible with Keil’s pro- 
posed shift from characteristic to defining 
features in category judgments (Keil & Batter- 
man, 1984). Typically, characteristic features 
are object attributes, such as the shape of an 
animal, and defining features are relations, 
such as the causal regularity that parents pro- 
duce children of the same species as them- 
selves. Another antecedent is Bruner’s pro- 
posal that children move from relying on 
perceptual-configural information to relying 
on functional information (Bruner, Olver, 
& Greenfield, 1966). 


The notion of a relational shift allows a 
more orderly account of the development of 
metaphor. But what underlies this shift? At 
least three possibilities are suggested by the 
literature. The first possibility is that it reflects 
an increase in basic cognitive competence 
(Inhelder & Piaget, 1958; Pascual-Leone, 
1970). Thus, Billow (1975) interpreted his 
findings in terms of cognitive stages: similar 
(attributional) metaphors are early because 
they can be understood at the concrete opera- 


5 The relational responses were in general longer (in number of words) than the attributional 
responses, so a length bias would have inflated the number of relational responses. As another 
check on this possibility, | compared the results for four R and D metaphors that had the reverse 
pattern—attributional responses longer than relational responses—with the overall results for R and 
D metaphors. There were no systematic differences in the results. 
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tional level; proportional (relational) meta- 
phors are acquired later because they require 
formal operational thinking. The second pos- 
sibility is that the relational shift reflects chil- 
dren’s learning of adult pragmatic conven- 
tions concerning what to map. If children did 
not share the aesthetic that metaphors are 
supposed to be about shared relations, then 
their failure to perform relational mappings 
would not necessarily indicate a lack of com- 
petence. The third possibility is that the rela- 
tional shift results from accretion of domain 
knowledge; it is an instance of the novice- 
expert shift (Carey, 1984; Chi, Feltovich, & 
Glaser, 1981; Larkin, 1983; Siegler, 1976). By 
this account, children’s failure to perform re- 
lational mappings results not from cognitive 
deficiencies but from a lack of knowledge of 
the necessary domain relations. 


Although there is as yet no definitive evi- 
dence, there is one line of comparison that 
can differentiate among these three ap- 
proaches. The first two accounts—the cogni- 
tive competence account and the pragmatic 
insight account—both postulate a global shift 
that applies across domains. In contrast, the 
domain-knowledge account postulates that 
the relational shift will occur at different 
times in different domains. Moreover, it 
should occur earliest in those domains that 
are most accessible to young children. This 
position is supported by the fact that the few 
studies that have shown early ability to per- 
form relational mappings have used ex- 
tremely familiar domains (Brown & Campi- 
one, 1985; Crisafi & Brown, 1986; Gardner, 
1974; Gentner, 1977a, 1977b). For example, 
in an earlier study I found that very young 
children could interpret metaphors with body 
parts as the base domain (Gentmer, 1977a, 
1977b). Children were shown pictures of fa- 
miliar objects such as trees and asked ques- 
tions like, “If the tree had a knee where 
would it be?” Children as young as 4 could 
accurately map the familiar spatial relations 
among body parts, even when the task was 
made more difficult by turning the tree up- 
side down or by adding misleading details. 
Similarly, Crisafi and Brown (1986) found that 
preschoolers could learn to do a difficult com- 
bination-of-inferences problem by analogy 
with an easier problem, but only if the base 
problem was made up of extremely familiar 
objects and activities. 


By the domain-knowledge account, the 
relational shift should occur early for some 
domains and late for others. DeLoache’s 
(1985) work on search behavior provides in- 
triguing evidence for a very early relational 


shift in spatial matching ability. At 3-1, chil- 
dren can solve problems in which they see an 
object hidden in a furnished room and then 
look for a tiny replica of the object in a tiny 
replica of the room. But at 2-7, children fail 
this task; they can successfully find the object 
only if the two rooms are identical. Perhaps it 
is during this interval that understanding of 
spatial relations progresses to the point where 
children can match on the basis of relational 
structure without the support of object iden- 
tity. At the other end of the developmental 
spectrum, Sternberg and Downing (1982) 
found that adolescents go through some of the 
same stages of interpretation when dealing 
with complex analogies between analogies 
that younger children do with simple analo- 


gies. 


The domain-knowledge account of the 
relational shift still faces unanswered ques- 
tions. In order to account for the data, the do- 
main-knowledge account must presuppose 
that in every domain relational information is 
slower to be acquired than object-attribute in- 
formation. This may be a plausible intuition, 
but it has not been verified. A second prob- 
lem is specifying what constitutes a domain. 
Despite these unanswered questions, the do- 
main-knowledge account is the best current 
candidate for explaining the relational shift. 
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